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THE DISTBIBUTION OF PUBE LINE MEANS 

DE. J. ARTHUR HARRIS 

Carnegie Institution of Washington 

Sevebal times recently we have been told that the 
means of a character in a series of pure lines form a 
' ' Quetelet 's Curve. ' n Some of those responsible for this 
assertion seem to attribute a particular virtue to " Que- 
telet's Law," and to feel that the statement that the 
means of a series of pure lines form a chance curve fur- 
nishes uncontrovertible evidence for the genotype theory 
of heredity. The questions which interest the biologist 
are, first, whether the statement is true in the sense that 
it is made on a sufficient body of actual observations, and 
second, what is the general biological significance to be 
attached to it, if true. 

But among these biologists the interpretation of the 
facts has apparently preceded the demonstration of the 
existence of the facts themselves. Now while it is not at 
all unlikely that the means of genotypes — if such entities 
in Johannsen's sense of the term do exist in nature — 
form a chance curve, it by no means follows that con- 
versely a series of averages which can be arranged in a 
symmetrical variation polygon proves or even suggests 
the existence of differentiated pure lines or biotypes. 
Yet just such differences in means are being accepted 
and cited without criticism as valid evidence in support 
of Johannsen's sweeping generalizations. 

A case in point is a paper by Boemer 2 on pure lines in 
peas. It is with regret that one criticizes Boemer 's 

'Compare, for example, in this connection: Nilsson-Ehle, Hot. Not., 1907, 
pp. 113-140; Lang, Zeitsclir. f. Ind. Abst.- u. Vererbungsl., Vol. 4, pp. 15- 
16, 1910; Spillman, Amer. Nat., Vol. 44, p. 761, 1910; Pearl, Amer. Nat., 
Vol. 45, p. 423, 1911. 

2 Eoemer, T., ' ' Variabilitatsstudien, ' ' Arch. f. Sassen- u. Oesellsch.- 
Biologie, Vol. 7, pp. 397-469, 1910. 
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paper. It is an exceedingly laborious Arbeit and appa- 
rently done with scrupulous care. One who himself has 
experienced the labor of calculating a few tables of con- 
stants has sympathy for a worker who has industriously 
filled pages with them. But the tenability of the geno- 
type theory is one of the most pressing of current evolu- 
tionary problems, and all available evidence must be 
scrutinized. Eoemer's data are chosen for two very 
excellent reasons, the first of which is that of all of the 
men who have discussed the disposition of the means of 
pure lines in a " Quetelet's Curve," he is, so far as I am 
aware, the only one who has put on record sufficient data 
for a critical test of his conclusions. If without over- 
trying the case, as the lawyers have it, we can give the 
second reason, it is that Eoemer's data and conclusions 
have been accepted as perfectly valid by genotype 
specialists. One of them, for example, says: 

The work is essentially a confirmation, with another plant, of 
Johannsen's epoch-making investigations on beans, though it lacks any 
extensive studies on the effect of selection within the pure line. The 
essential objective point of Eoemer's research is rather to determine 
the biometric characteristics of pure lines as such in relation to the 
general population. Among the more important general results are 
the following: 

1. The different biot3'pes in a population arrange themselves in fre- 
quency distributions in accord with Quetelet's Law. 

2. No relation was found to exist between the variability of the 
biotypes (i. e., variation within the general population) and variation 
within the pure lines. 

Our problem is twofold. First, we have to determine 
whether Eoemer is really justified in regarding his lines 
as differentiated. Second, we have to inquire concerning 
the critical value of his data as evidence in support of 
the genotype theory of heredity. Incidently we shall 
make the first of these problems serve as an illustration 
of the use of a coefficient of individual prepotency recently 
proposed in these pages. 3 

3 Harris, J. Arthur, "A Coefficient of Individual Prepotency for Stu- 
dents of Heredity," Amer. Nat., Vol. 45, pp. 471-478. 1911. 
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II. The Peoblem of Diffebentiation in Roemeb's 
Ptjbe Lines 

The method of Roemer's study was very simple. In 
1908 a population of pea plants was grown from a sample 
of ordinary seed. In 1909 the offspring of each of a num- 
ber of these plants was studied separately, and the means 
of several characters calculated. By a comparison of 
selected pairs of these means Roemer concludes that the 
several lines differ from each other, and by a seriation of 
all the line means he obtains the Quetelet 's curve. 

Such evidence as this can not be accepted. Every 
mean calculated on a sample of individuals is more or less 
untrustworthy as a measure of the character in individ- 
uals in general, because of the errors of random sam- 
pling, and in attaching significance to a series of averages 
this fact must be fully taken into account. It can not 
adequately be allowed for by a comparison of selected 
cases with their probable errors. 

First Test. A Comparison of the Variability within the 
"Pure Line" ivith that of the "Population" 

One of the tests of the presence of differentiated 
"biotypes," "genotypes" or "pure lines" within a 
"population" is the comparison of the intra-line with the 
population variability. If both be the same there is no 
justification in the assumption that the population is 
composed of a number of differentiated pure lines. 4 If 
the variability of the population is greater than that of 
the individual lines it may (or may not) comprise a series 
of "genotypes." 

The reason for this is obvious. The standard devia- 
tion within the pure line, o-, describes only the differences 
occurring among the individuals of the group, while 2, 
the standard deviation of the group, includes also the 
amounts by which the several lines are differentiated. 

* This is, of course, under condition that the individuals of the several 
pure lines are not reared under conditions which tend to increase artificially 
their variability beyond, that of the population. 



No. 539] DISTRIBUTION OF PURE LINE MEANS 689 

Eoemer does not give us the population standard 
deviations for the several characters in 1909 but only the 
averages, ot 1; m 2 , m s , ••■, m s , and the standard deviations 
a i> a 2> °"3> '") CT s- We ma y approximate the desired con- 
stants very closely indeed 5 by the following method. 

Let there he s samples or pure lines of n y , n 2 , n 3 , ■ ■ ■, n» 
individuals each, with means h/ 1? m 2 , m 3 , ■■■, m s , and 
standard deviations <r 1 , o- 2 , o- 3 , •••, o- s . These form the 
population S(n) ==N, for which the physical constants 
2 and M are desired. 

The mean is clearly M = S (am) / S (n) . 

In calculating the S.D. we may take the first two 
rough moments, v/, v./, about any point we please and 
adjust by the familiar formula o- 2 = //,, = v./ — v/". If 
the moments be taken about 0° v/ = M, and it is at once 
clear that for the population 



\ S(n) V S(n) ) ' 

when S indicates a summation for all groups or lines. 7 
The population constants have been calculated by these 
formula? for all the characters of Eoemer 's two large 
series. He has given population constants, M and S, 
for the 1908 series, the parents of the 1909 plants. 

The two are conveniently laid side by side for com- 
parison in Table I. 8 The data in hand hardly seem to 
justify detailed comparison with reference to probable 

6 There is no approximation in the formula. The accuracy in practise 
depends solely upon the trustworthiness of the original mi's and <r's, and 
upon the number of decimal places retained in the arithmetical routine. 

G For several advantages in doing this see Amer. Nat., Vol. 44, pp. 693- 
699. 1910. 

' The application of the formula to Eoemer 's data is of course exceed- 
ingly laborious, involving as it does the determination and summation by 
pairs of 3,108 squares, and the summation of the products of their totals 
by the frequencies upon which they are based. The publication of a little 
tabulated data would have reduced many days ' labor necessary for a critical 
test of his results to a few hours. 

8 The constants for 1908 are taken from Eoemer 's Table I. Those for 
1909 are calculated by the formulas given above. 
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errors. It will be noted at once that for all the char- 
acters the mean is higher in 1909 9 — indeed for some 
characters in the "Kapital Erbse" it is almost double 
that found in 1908! With one exception the standard 

TABLE I 

Comparison of Physical Constants for 1908 and 1909 Populations 





Averages 


Standard Deviations 


Coefficient of "Vari- 
ation 


parison 


Gelbe 

Viktoria- 

Erbse 


Svalofs 

Kapital- 

Erbse 


Gelbe 

Viktoria- 

Erbse 


Svalofs 

Kapital- 

Erbse 


Gelbe 

Viktoria- 

Erbse 


Svalofs 

Kapital- 

Erbse 


Weight of Plant 
1908 Population 
1909 


13.09 

20.82 


7.99 
20.39 


4.250 
6.568 


2.815 
7.127 


32.47 
31.54 


35.23 
34.95 


1909/1908 Eatio 
Length of Stem 

1908 Population 

1909 " 


1.59 

114.96 
136.81 


2.55 

78.96 
158.42 


1.545 

12.985 
16.331 


2.531 

12.575 
20.163 


.97 

11.30 
11.93 


.99 

15.95 
12.73 


1909/1908 Eatio 
Thickness of Stem 

1908 Population 

1909 " 


1.19 

24.03 
25.20 

1.05 

4.59 
6.48 


2.01 

20.50 
24.05 


1.257 

2.766 
2.390 


1.603 

2.081 
2.933 


1.06 

11.50 
9.48 


.80 

10.15 
12.19 


1909/1908 Eatio 
Number of Pods 

1908 Population 

1909 " 


1.17 

5.62 
11.54 


.864 

1.364 
1.987 


1.409 

1.805 
4.263 


.82 

20.72 
30.66 


1.20 

32.12 
36.95 


1909/1908 Eatio 
Weight of Pods 

1908 Population 

1909 " 


1.41 

9.71 
11.28 


2.05 

5.76 
10.26 


1.456 

3.192 
4.290 


2.361 

2.184 
4.407 


1.48 

32.80 
38.03 


1.15 

37.95 
42.98 


1909/1908 Eatio 
Number of Seeds 
1908 Population 
1909 


1.16 

19.64 
26.60 


1.78 

24.02 
45.11 


1.343 

6.267 
9.162 


2.017 

8.418 
18.633 


1.16 

31.91 
34.44 


1.13 

35.05 
41.31 


1909/1908 Ratio 
Weight of Seeds 
1908 Population 
1909 


1.35 

7.63 
8.56 


1.88 

4.56 
7.26 


1.461 

2.569 
3.511 


2.213 

1.833 
3.376 


1.08 

33.67 
40.99 


1.18 

40.21 
46.49 


1909/1908 Eatio 


1.12 


1.59 


1.366 


1.841 


1.22 


1.16 



deviations in 1909 are higher than those in 1908. Mean 
and standard deviation are generally closely correlated, 
and this doubtless accounts for the greater variability 
of the 1909 series. Possibly, however, the 1909 plants 



9 Eoemer states that conditions for growth in 1909 were superior to 
those in 1908. 
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were grown under conditions more heterogeneous than 
those to which the 1908 plants were exposed. In the 
second case, the S.D. might he directly raised, i. e., 
heterogeneity in the crop may he merely a reflection of 
heterogeneity in the substratum. 

There is no way of determining whether Eoemer's 
cultural conditions were more heterogeneous in 1909 
than in 1908, but it must 'be noted that in ten of the four- 
teen cases the coefficient of variation is higher in 1909. 

Two ratios are to be examined, 

Mean Pure Line Variability 



Parental Population Variability' 

Me an Pure Line Variability 
General Population Variability * 

Consider first the ratio of the mean pure line to the pa- 
rental population variability. If the offspring of the 
individual parents are differentiated we should expect 
to find the mean variability of the pure lines less than 
that of the parent population, providing, of course, 
that innate tendencies are not obscured by environmental 
factors. Table II 10 gives the necessary data. 

Now the remarkable thing about these standard devia- 
tions is that in the most cases the variability within the 
individual "pure lines" in 1909 is greater than that of 
a mixture of all the pure lines in 1908. The excess is 
very striking in several cases. Of the fourteen com- 
parisons, thirteen show a higher variability within the 
pure line than in the population. For the "Viktoria" 

10 For 1908 the population 2 and C.V. are from Roemer's Table I. The 
1909 population 2 and C.V. have been calculated by the formulas given 
above. The mean pure line standard deviations have been taken from 
Roemer's Tables II-III. None of the constants have been rechecked, since 
the original data are not available. The mean value of 12.13 for thickness 
of stem in Table II is obviously a printer's slip for 2.13. The mean pure 
line coefficients of variation are from Roemer's Table X. These were not 
calculated by dividing the sum of the coefficients of variation of the indi- 
vidual pure lines by the number of lines, but by dividing the mean standard 
deviation of the pure lines by the mean average of the pure lines. 
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tlie average M.P.L.'/Parental ratio is 1.327 while for the 
"Kapital" it is 1.996. The "pure line" variability is 
thus from 30 to 100 per cent, in excess of that of the 
population. 

TABLE II 

Comparisons of Mean Pure Line and Population Variability 



Character and Comparison 



Weight of Stem 

Population 

Mean Pure Line. 



M.P.L./Pop. Eatio 
Length of Stem 

Population 

Mean Pure Line 



M.P.L./Pop. Ratio. 
Thickness of Stem 

Population 

Mean Pure Line 



M.P.L./Pop. Ratio 
Number of Pods 

Population 

Mean Pure Line. 



M. PL. /Pop. Ratio. 
Weight of Pods 

Population 

Mean Pure Line. ... 



M.P.L./Pop. Ratio 
Number of Seeds 

Population 

Mean Pure Line. ... 



M.P.L./Pop. Ratio. . 
Weight of Seeds 

Population 

Mean Pure Line.. .... 



M.P.L./Pop. Ratio.. 



1908-1909. Stand- 
ard Deviations 



Gelbe 

Viktoria- 

Erbse 



4.250 
5.96 



1.402 

12.985 
13.80 

1.063 

2.766 
2.13 



.770 

1.364 
1.80 



1.320 

3.192 
3.80 



1.190 

6.267 
8 01 



1.278 

2.569 
3.17 



1.234 



Svalofs 
Kapital- 

Krbse 



2.815 
6.44 



2.288 

12.575 
15.98 



1.271 

2.081 
2.04 



.980 

1.805 
3.92 



2.172 

2. 184 
3.95 



1.809 

8.418 
16.60 



1.972 

1.833 
3.04 



1.658 



1908-1909. Coeffi- 
cient of Variation 



Gelbe 

Viktoria- 

Erbse 



32.47 
28.53 



11.30 
10.05 



.89 

11.50 

8.45 



29 72 
27.19 



.91 

32.80 
33.84 



1.03 

31.91 
33.30 



1.04 

33.67 
37.05 



1.10 



Svalofs 
Kapital- 

Krbse 



35.23 
31.40 



.89 

15.95 
10.05 



.63 

10.15 

8.46 



.83 

32.12 
34.00 



1.06 

37.95 
38.50 



1.01 

35.05 
36.80 



1.05 

40.21 
41.90 



1.04 



1909-1909. Stand- 
ard Deviations 



Gelbe 

Viktoria- 

Erbse 



6.568 
5.96 



.907 

16.331 
13.80 



.845 

2.390 
2.13 



.891 

1.987 
1.80 



.906 
4.290 



.885 

9.163 
8.01 



.874 

3.511 
3.17 



.903 



Svalofs 
Kapital- 

Ei-bse 



7.127 
6.44 



.904 

20.163 
15.98 



.792 

2.933 
2.04 



.696 

4.263 
3.92 



.920 

4.407 
3.95 



.896 

18.633 
16.60 



.891 

3.376 
3.04 



.900 



The explanation of this anomalous result is first to 
be sought in the higher means (with the associated 
higher variability) in the 1909 plants. Basing the com- 
parison on the coefficients of variation in order to elimi- 
nate, in so far as possible, the influence of the means, 
we note that seven of the ratios are greater and seven 
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are less than unity, while for all the mean is .936. On 
an average, therefore, the pure lines have 93.6 per cent. 
as much variability as the population. 

The second comparison, that between the variability 
of the individual pure lines and the population which 
they form, can be made on the basis of the standard 
deviations alone since the means are the same. This 
comparison (the last two columns of Table IT) shows 
that in both series and for every character the variabil- 
ity written, the line is less than that for the population. 
The lowest ratio is .70, the highest is .92 and the mean 
is .858. This test indicates that they are differentiated. 
This is, of course, the conclusion which Eoemer drew 
from his selected individual comparisons. 

Second Test. The Deviation of the Pure Line Means 
from the Population Mean 

For characters measurable on a quantitative scale the 
test for the deviation of the offspring of an individual 
from its population is given by 



IT- a- ( 2n\ 
(m-JI)±.67449 A | F + -(l-^] 



T- ( o*( t 2n\ n(lU-mf 



Where m and M, <r and 2, n and N are the means, stand- 
ard deviations, and numbers of individuals for the 
family and the population, respectively. 11 

For reasons which will be apparent to the reader 
later, the data which are given us do not justify calcu- 
lations to a high degree of refinement. 12 We therefore 
approximate in every point possible. 

The expressions 

2n n(M-mf 
1 ~Ti' N(N^nj 

"Amer. Nat., Vol. 45, pp. 471-478. 1911. 

12 Furthermore, I have serious misgivings that Eoemer 's lines comprise 
so few individuals each that the coefficient suggested must be used with 
caution. It is not needful to consider the point in greater detail here. 
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may be disregarded, since with such relatively low values 
of n as those of Eoemer's data the first is practically 
unity, while the second is generally insignificant and 
may always be neglected, having a maximum value of 
circa .010, and in the majority of the cases falling far 
towards zero. The values of S 2 /jV are given in Table 
III. In practically every case the inclusion of 5 2 /iV in 

TABLE III 

Values op Z 2 /N 

GelbeViktoria Svalofs Kapital 

Character Erbse Erbse 

1. Weight of plant 0233 .0292 

2. Length of stem 1442 .9204 

3. Thickness of stem 0031 .0049 

4. Number of pods 0021 .0105 

5. Weight of pods 0100 .0112 

6.. Number of seeds 0454 .1997 

7. Weight of seeds 0067 .0066 

formula would change the end result only insignifi- 
cantly, and since we are working roughly it may be 
omitted throughout. 

We have thus reduced the formula to the fundamental 
term yJcPjn. Fortunately for us, Eoemer has not used the 
correct formula for the probable error of the mean but 
has given Johannsen's "Mittlerer Fehler," <r/\/n = 
V^V^ We now simply determine m — M 13 and 
( m — ilf)/(o-/Vn") for each of his lines. Table IV gives 
the ratios. 

Apparently there can be no question concerning the 
reality of differentiation in Eoemer's lines. If the differ- 
ences (m — M) were due purely to random sampling 
from a homogeneous population, we should expect the 

13 This second test was carried out before the first, and so before the 
actual population means and variabilities were available. The means used 
were Eoemer's means of pure line averages, as given in his Table IX. 
These would be the same as the population means if all the lines had the 
same n. As a matter of fact, the agreement is very close in all. In two 
cases only did I modify Eoemer's means — changing number of pods from 
6.6 to 6.5 and weight of pods from 11.2 to 11.3, in the "Gelbe Viktoria 
Erbse. ' ' 
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TABLE IV 



Ratio of Deviation to 
Probable Error 


Character 14 in Gelbe Viktoria 
Erbse 


Character in Svalofs Kapital 
Erbse 


1 


2 


3 | 4 


5 j 6 


7 


1 


2 


3 


4 


5 


6 


7 


— 15. 5 to — 14.5 

— 14.5 to — 13.5 

— 13.5 to — 12.5 

— 12.5 to — 11.5 

— 11.5 to— 10.5 

— 10.5 to— 9.5 

— 9.5 to — 8.5 

— 8.5 to — 7.5 

— 7.5 to— 6.5 

— 6.5 to— 5.5 

— 5.5 to — 4.5 

— 4.5 to— 3.5 

— 3.5 to — 2.5 

— 2.5 to — 1.5 

— 1.5 to — 0.5 

— 0.5 to + 0.5 
+ 0.5 to + 1.5 
+ 1.5 to + 2.5 
+ 2.5 to + 3.5 
+ 3.5 to + 4.5 
+ 4.5 to + 5.5 
+ 5.5 to + 6.5 
+ 6.5 to + 7.5 
+ 7.5 to + 8.5 
+ 8.5 to + 9.5 
+ 9.5 to + 10.5 
+ 10.5 to + 11.5 
+ 11.5 to + 12.5 


4 

1 

6 

13 

29 

24 

25 

8 

5 

1 


2 

3 

5 

12 

12 

19 

13 

21 

14 

6 

2 

3 

2 

1 


3 

8 

18 

19 

29 

13 

10 

9 

4 

1 

1 


1 

4 

5 

13 

24 

31 

23 

9 

2 

2 

1 


1 

3 
1 

4 

15 

24 

28 

22 

9 

5 

1 

1 

1 


3 

4 

3 

18 

25 

22 

24 

13 

1 

2 

1 


2 

4 

5 

14 

25 

24 

23 

12 

4 

1 

1 


2 

5 

7 

8 

19 

26 

26 

9 

3 

1 
1 


1 

1 
1 
1 

1 

1 

6 
2 
8 

15 

18 

25 

13 

7 

3 

1 

1 

1 

1 


1 

2 

2 

15 

25 

30 

12 

13 

3 

2 

1 

1 


1 
1 

2 

5 

1 

7 

15 

36 

21 

12 

4 

2 


1 

5 

4 

10 

24 

17 

28 

14 

2 

1 

1 


1 

1 

2 

4 

11 

19 

22 

27 

15 

4 

1 


1 

1 

1 

3 

6 

13 

16 

19 

30 

12 

4 

1 



ratios (to — Hf)/o- (m _ jV) to have a standard deviation of 
1 ± .67449/ V 2s, where s is the number of lines involved. 
The constants are given in Table V. 



TABLE V 

Character Gelbe Viktoria Erbse 

1. Weight of plant 1.738 ± .077 

2. Length of stem 2.620 ± .117 

3. Thickness of stem 2.008 ± .089 

4. Number of pods 1.717 ± .076 

5. Weight of pods 2.213 ± .098 

6. Number of seeds 1.857 ± .083 

7. Weight of seeds 1.895 ± .084 



Svalofs Kapital Erbse 
1.886 ± .087 
3.209 ± .148 
1.768 ± .082 
1.955 ± .090 
1.839 ± .085 
1.759 ± .081 
2.188 ± .101 



Now we remember that Roemer's individual lines are 
represented by so few individuals that our formula may 
not give absolutely trustworthy results, and that in case 

"Numbers of characters are the same as in Tables III and V. 
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it does not, the discrepancy is in favor of the pure linist. 
But comparing these values with 1.000 ± .044 for "Vik- 
toria" and 1.000 ± .046 for "Kapital," I think we must 
admit that the evidence is strongly in favor of a differ- 
entiation in these lines of peas. 

III. The Beabing of These Data upon the Genotype 
Theoky of Heredity 

To the conclusion that these pure lines of peas are 
differentiated among themselves and that speaking 
roughly their means "form a Quetelet's Curve," there 
can be no objection, although this conclusion is by no 
means justified by Boemer's own analysis of his data. 
The assumption that these facts lend any support what- 
ever to the genotype theory 15 seems to me to rest not only 
upon the most slipshod reasoning, but upon a complete 
disregard of simple biological precautions. 

The chief of the pertinent reasons follow. 

First. There is no Evidence of Line, or Genotypic Con- 
stancy or Heredity. 

By definition the genotype is a rigid organic entity, 
distinguished by breeding true from generation to gen- 
eration, with the exception of mutations which are com- 
pletely inherited and fluctuations which are not inherited 
at all. The mean of the line remains the same from 
generation to generation (except for rises and falls due 
to periodic environmental changes) ; the variations 
around these means are absolutely incapable of inherit- 
ance. 

Boemer and his supporters of course assume this to be 
true for the Pisum series dealt with, but no fragment of 
evidence is adduced to show either (a) that these means 
remain the same from generation to generation, or (b) 
that selection within the pure line is ineffective. The 
condition is even worse than this. Boemer measured the 

13 The cardinal points of the genotype theory have been tersely, and I 
believe fairly, summarized elsewhere in these pages. See Amer. Nat., Vol. 
45, pp. 346-363, 1911. 
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characters of the parent plants which furnished the seed 
from which his various pure lines were grown, but he 
neither determines whether there is any parental corre- 
lation in the population nor gives us the data from which 
this might be determined. To be sure, one may determine 
from his data that the variability within the individual 
line is less than that of the population, but this might be 
as easily attributed to the much-scouted "Gralton's Law" 
as to genotypic heredity; possibly it is to be referred to 
some factor quite outside inheritance. 16 In fine, it is 
impossible to determine anything at all about inheritance 
from Roemer's data. 

Second. The Possibility of Environmental Hetero- 
geneity is not Adequately Taken, into Account. 

One of the first precautions of the experimental 
breeder should be to make sure that the environmental 
conditions to which his materials are subjected are not so 
heterogeneous as to vitiate entirely any conclusions con- 
cerning innate factors. Two watch glasses of Paramenia 
may present highly different environmental conditions. 
Food and housing are important factors in the egg 
records of poultry. Peas differ from row to row, or 
within the row, because of slight differences in the sub- 
stratum. But experimenters now-a-days are so obsessed 
with the idea of rigid "unit characters," "determiners" 
and "genes" that little attention is paid to environmental 
influences; they are so absorbed in "analyzing" the 
"germ plasm" that they forget to make proper allowance 
for factors which may so modify the soma — which is the 
only available index of the germ plasm of an individual 
— that it is misleading as a guide to hereditary tendencies. 

From my own experience in experimental cultures it 
seems quite possible that the differences in Roemer's lines 
are due to lack of uniformity in the substratum. One is 

ie For instance, de Vries has several times suggested that the feeding of 
the parent plant may have an effect on the offspring. I personally do not 
believe that this could be a sensible source of differentiation in Roemer 's 
peas, but it is one of the factors which should be taken into account by a 
critical student of the problem. 
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not much impressed with the consistency of his results. 
Not only are his means and standard deviations much 
higher in 1909 than in 1908, but the relative variabilities 
are also higher in ten out of fourteen cases. To be sure, 
he has used the conventional precautions. He describes 
his field as a "gleichmassiger humoser Lehmboden," and 
states that the fertilizer was mixed with soil before appli- 
cation and distributed as evenly as possible. But for an 
organism so responsive to environmental influences as 
the garden pea, 1T and in a problem of this delicacy, these 
precautions are not at all sufficient. It is quite clear 18 
that the seeds from each parent were planted together 
in rows, and if the soil differed at all from one part of the 
field to another the tendency would be for this hetero- 
geneity to induce a differentiation in the crop. 

If it be urged that we do not know that the differentia- 
tion in Eoemer 's means are due to environmental hetero- 
geneity, the reply is simple. It is the duty of those who 
claim genotypic rank for observed differences to prove 
that their results are not due merely to faulty experi- 
mental conditions. 

Third. One or Both of Roemer's "Populations" are 
in Reality "Pure Lines." 

By definition a pure line is the offspring of a single 
individual of a continually self -fertilizing or vegetatively 
propagating organism. They may or may not be differ- 
entiated. G-enotypists assume that generally they are 
both differentiated and highly constant. Selection within 
the line is absolutely incapable of effecting any change. 

Now the curious thing about Eoemer 's material is that 
at least one, and possibly both, of the two populations 
suitable for our examination are in reality pure lines. 
Whether the "Individualauslese" by which the "Grelbe 
Viktoria Erbse" was developed gave rise to a pure line 
depends largely upon the stringency with which it was 

17 Roemer observes that the pea is exceedingly susceptible to environ- 
mental influences. 

18 See Eoemer, I, c, pp. 404-405. 
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carried out. Concerning "Svalofs Kapital Erbse" there 
can be no doubt. The original plant was selected at 
Svalof in 1896. Whether a further selection within the 
offspring of this plant was made, Eoemer was unable to 
determine, but of the ancestry of his seed he felt quite 
certain. 

Truly this is an anomalous state of affairs ! Analyzed 
by the best available statistical methods, Eoemer 's data 
certainly indicate that the lines studied are significantly 
differentiated. Pure-line specialists dispense with any 
statistical analysis at all and accept the data as "a con- 
firmation ... of Johannsen's epoch-making investigations 
on beans." Yet if the differentiation in these lines be 
due to anything other than faulty experimental condi- 
tions, the observations described destroy entirely the 
value of Johannsen's theory by showing that heritable 
variations may occur in great numbers in the pure line. 

IV. SUMMAET AND CONCLUSIONS 

1. The statement that the means of the pure lines of a 
population form a "Quetelet's Curve" is now being made 
by genotypists. If it is true that an apparently homo- 
geneous population is composed of a large number of 
slightly differentiated genotypes, it seems a priori not 
unlikely that their means will be arranged according to 
"Quetelet's Law." The question which concerns the 
biologist is whether this is, as a matter of fact, the condi- 
tion found in nature. The object of the present rather 
laborious study has been to test the validity of this asser- 
tion on the basis of available facts. Eoemer 's data for 
pure lines in peas are the only passably satisfactory 
published series available. 

2. Such a problem has two phases. It is first necessary 
to determine by adequate statistical tests that the lines 
in question may be reasonably regarded as differentiated 
biologically — i. e., that the differences between them can- 
not be explained as the errors of random sampling, such 
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as give one a low or a high hand at cards. It is then 
allowable to consider the biological interpretation of the 
differences. 

3. Two tests for differentiation were applied: (a) the 
mean intra-line variability was compared with the popu- 
lation variability, and (b) the significance of the devia- 
tion of individual line means from the population mean 
was tested by a coefficient of individual prepotency re- 
cently suggested. Both of these tests indicate sensible 
and statistically significant differences between the lines. 
These differences may be said to be distributed according 
to "Quetelet's Law" as the term is loosely used by 
biologists. 

4. This fact per se furnishes no evidence at all for the 
genotypic nature of the differences in Eoemer's lines. 
Indeed, throughout Eoemer's work there is no conclusive 
evidence of any kind concerning any problem of heredity. 
At least one (and possibly both) of his series of material 
is from his own explicit statements in reality a pure line. 
The difference observed within these lines and considered 
by him and other pure linists to be of genotypic value and 
a confirmation of Johannsen's results with beans are 
probably merely the result of faulty experimental condi- 
tions. If they are not, Eoemer's evidence goes squarely 
against Johannsen's theory. 

Cold Spring Harbor, L. I., 
July 19, 1911. 



